当今一些最先进的深度学习模型的出色表现在某种程度上是由于在大型数据集上进行了广泛的(自我)监督的对比预处理。相比之下,该网络是通过成对的正(相似)和负(不同的)数据点呈现的,并经过培训以找到每个数据点的嵌入向量,即一个表示形式,可以进一步调整各种下游任务。为了将这些模型安全地部署在关键的决策系统中,至关重要的是要使他们衡量其不确定性或可靠性。然而,由于训练对比模型的成对性质,并且在输出(抽象嵌入矢量)上缺乏绝对标签,因此将常规不确定性估计技术适应此类模型是不平凡的。在这项工作中,我们研究是否可以以有意义的方式量化此类表示形式的不确定性。换句话说,我们探索给定数据点上的下游性能是否可以直接从其预训练的嵌入中预测。我们表明,可以通过直接估算嵌入空间中训练数据的分布并考虑表示表示的局部一致性来实现此目标。我们的实验表明,嵌入向量的不确定性概念通常与其下游精度密切相关。
translated by 谷歌翻译
我们研究了$ \ Mathcal {r} $的结构和统计属性 - 规范最小化由特定目标函数标记的数据集的内侧插值。$ \ MATHCAL {R} $ - 标准是两层神经网络的电感偏差的基础,最近引入了捕获网络权重大小的功能效果,与网络宽度无关。我们发现,即使有适合数据的脊函数,这些插值也是本质上的多元功能,而且$ \ Mathcal {r} $ - 规范归纳偏见不足以实现某些学习问题的统计上最佳概括。总的来说,这些结果为与实际神经网络训练有关的感应偏见提供了新的启示。
translated by 谷歌翻译
The field of autonomous mobile robots has undergone dramatic advancements over the past decades. Despite achieving important milestones, several challenges are yet to be addressed. Aggregating the achievements of the robotic community as survey papers is vital to keep the track of current state-of-the-art and the challenges that must be tackled in the future. This paper tries to provide a comprehensive review of autonomous mobile robots covering topics such as sensor types, mobile robot platforms, simulation tools, path planning and following, sensor fusion methods, obstacle avoidance, and SLAM. The urge to present a survey paper is twofold. First, autonomous navigation field evolves fast so writing survey papers regularly is crucial to keep the research community well-aware of the current status of this field. Second, deep learning methods have revolutionized many fields including autonomous navigation. Therefore, it is necessary to give an appropriate treatment of the role of deep learning in autonomous navigation as well which is covered in this paper. Future works and research gaps will also be discussed.
translated by 谷歌翻译
Regularising the parameter matrices of neural networks is ubiquitous in training deep models. Typical regularisation approaches suggest initialising weights using small random values, and to penalise weights to promote sparsity. However, these widely used techniques may be less effective in certain scenarios. Here, we study the Koopman autoencoder model which includes an encoder, a Koopman operator layer, and a decoder. These models have been designed and dedicated to tackle physics-related problems with interpretable dynamics and an ability to incorporate physics-related constraints. However, the majority of existing work employs standard regularisation practices. In our work, we take a step toward augmenting Koopman autoencoders with initialisation and penalty schemes tailored for physics-related settings. Specifically, we propose the "eigeninit" initialisation scheme that samples initial Koopman operators from specific eigenvalue distributions. In addition, we suggest the "eigenloss" penalty scheme that penalises the eigenvalues of the Koopman operator during training. We demonstrate the utility of these schemes on two synthetic data sets: a driven pendulum and flow past a cylinder; and two real-world problems: ocean surface temperatures and cyclone wind fields. We find on these datasets that eigenloss and eigeninit improves the convergence rate by up to a factor of 5, and that they reduce the cumulative long-term prediction error by up to a factor of 3. Such a finding points to the utility of incorporating similar schemes as an inductive bias in other physics-related deep learning approaches.
translated by 谷歌翻译
This paper presents an algorithm that relies on a series of dense and deep neural networks for passive microwave retrieval of precipitation. The neural networks learn from coincidences of brightness temperatures from the Global Precipitation Measurement (GPM) Microwave Imager (GMI) with the active precipitating retrievals from the Dual-frequency Precipitation Radar (DPR) onboard GPM as well as those from the {CloudSat} Profiling Radar (CPR). The algorithm first detects the precipitation occurrence and phase and then estimates its rate, while conditioning the results to some key ancillary information including parameters related to cloud microphysical properties. The results indicate that we can reconstruct the DPR rainfall and CPR snowfall with a detection probability of more than 0.95 while the probability of a false alarm remains below 0.08 and 0.03, respectively. Conditioned to the occurrence of precipitation, the unbiased root mean squared error in estimation of rainfall (snowfall) rate using DPR (CPR) data is less than 0.8 (0.1) mm/hr over oceans and land. Beyond methodological developments, comparing the results with ERA5 reanalysis and official GPM products demonstrates that the uncertainty in global satellite snowfall retrievals continues to be large while there is a good agreement among rainfall products. Moreover, the results indicate that CPR active snowfall data can improve passive microwave estimates of global snowfall while the current CPR rainfall retrievals should only be used for detection and not estimation of rates.
translated by 谷歌翻译
Backpropagation is widely used to train artificial neural networks, but its relationship to synaptic plasticity in the brain is unknown. Some biological models of backpropagation rely on feedback projections that are symmetric with feedforward connections, but experiments do not corroborate the existence of such symmetric backward connectivity. Random feedback alignment offers an alternative model in which errors are propagated backward through fixed, random backward connections. This approach successfully trains shallow models, but learns slowly and does not perform well with deeper models or online learning. In this study, we develop a novel meta-plasticity approach to discover interpretable, biologically plausible plasticity rules that improve online learning performance with fixed random feedback connections. The resulting plasticity rules show improved online training of deep models in the low data regime. Our results highlight the potential of meta-plasticity to discover effective, interpretable learning rules satisfying biological constraints.
translated by 谷歌翻译
We consider a radio resource management (RRM) problem in a multi-user wireless network, where the goal is to optimize a network-wide utility function subject to constraints on the ergodic average performance of users. We propose a state-augmented parameterization for the RRM policy, where alongside the instantaneous network states, the RRM policy takes as input the set of dual variables corresponding to the constraints. We provide theoretical justification for the feasibility and near-optimality of the RRM decisions generated by the proposed state-augmented algorithm. Focusing on the power allocation problem with RRM policies parameterized by a graph neural network (GNN) and dual variables sampled from the dual descent dynamics, we numerically demonstrate that the proposed approach achieves a superior trade-off between mean, minimum, and 5th percentile rates than baseline methods.
translated by 谷歌翻译
Timely and effective response to humanitarian crises requires quick and accurate analysis of large amounts of text data - a process that can highly benefit from expert-assisted NLP systems trained on validated and annotated data in the humanitarian response domain. To enable creation of such NLP systems, we introduce and release HumSet, a novel and rich multilingual dataset of humanitarian response documents annotated by experts in the humanitarian response community. The dataset provides documents in three languages (English, French, Spanish) and covers a variety of humanitarian crises from 2018 to 2021 across the globe. For each document, HUMSET provides selected snippets (entries) as well as assigned classes to each entry annotated using common humanitarian information analysis frameworks. HUMSET also provides novel and challenging entry extraction and multi-label entry classification tasks. In this paper, we take a first step towards approaching these tasks and conduct a set of experiments on Pre-trained Language Models (PLM) to establish strong baselines for future research in this domain. The dataset is available at https://blog.thedeep.io/humset/.
translated by 谷歌翻译
随着人工智能的最新进展,可以在人类日常生活的各个方面看到其应用。从语音助手到移动医疗保健和自动驾驶,我们依靠AI方法的性能来完成许多关键任务;因此,必须以适当的手段进行预防损坏的方式主张模型的性能。通常,AI模型的短缺,尤其是深度机器学习,当面对数据分布的变化时,性能下降。尽管如此,在现实世界应用中始终期望这些转变。因此,已经出现了一个研究领域,重点是检测分布外数据子集并实现更全面的概括。此外,由于许多基于深度学习的模型在基准数据集上取得了近乎完美的结果,因此需要评估这些模型的可靠性和可靠性以推向现实世界应用程序的需求,这比以往任何时候都更加强烈。这引起了越来越多的研究领域的研究和领域的概括,这引起了对从各个角度比较这些研究进行比较的调查的需求,并突出了它们的平直和弱点。本文提出了一项调查,除了审查该领域的70多篇论文外,还提出了未来作品的挑战和方向,并为各种类型的数据转移和解决方案提供了统一的外观,以更好地泛化。
translated by 谷歌翻译
包括无人驾驶汽车(UAV)在内的自动移动机器人因其在建筑中的应用而受到了极大的关注。这些平台具有极大的潜力,可以自动化和增强许多任务所需数据的质量和频率,例如施工时间表更新,检查和监视。强大的本地化是可靠部署自动机器人平台的关键推动力。自动化的机器人解决方案主要依靠全球定位系统(GPS)进行户外定位。但是,GPS信号在室内被拒绝,并且经常使用预建的环境图来室内定位。这需要通过对环境中的移动机器人进行远程操作来产生高质量的地图。这种方法不仅耗时且乏味,而且在室内建筑环境中也是不可靠的。布局随着施工的进度而变化,需要频繁的映射会话来支持自主任务。此外,依赖视觉特征的基于视觉解决方案的有效性在现场低质地和重复区域都受到高度影响。为了应对这些挑战,我们以前提出了使用Apriltags的低成本,轻巧的基于标签的视觉惯性定位方法。在这种方法中,标签是具有已知尺寸和位置的纸张可打印地标,代表环境的准图。由于标签放置/更换是一个手动过程,因此它会遭受人体错误。在这项工作中,我们研究了人体错误在手动标签安装过程中的影响,并提出了一种随机方法,以使用谎言组理论来解决这种不确定性。使用蒙特卡洛模拟,我们通过实验表明,在我们的Manifold配方中纳入的拟议随机模型可提高基于标签的定位对在现场手动标签安装中不可避免的瑕疵的鲁棒性和准确性。
translated by 谷歌翻译